# Instruction Fine-Tuning Optimization

Granite 3.3 8b Instruct Q8 0 GGUF
Apache-2.0
This model is a GGUF format model converted from the IBM Granite-3.3-8B instruction fine-tuned model, suitable for text generation tasks.
Large Language Model
G
NikolayKozloff
36
2
Opencodereasoning Nemotron 7B
Apache-2.0
OpenCodeReasoning-Nemotron-7B is a large language model developed based on Qwen2.5-7B-Instruct, focusing on code generation and reasoning tasks, supporting a context length of 32K tokens.
Large Language Model Transformers Supports Multiple Languages
O
nvidia
5,410
30
Llama SEA LION V3.5 70B R
Llama-SEA-LION-v3.5-70B-R is a hybrid-function large language model optimized for Southeast Asian languages, supporting 13 languages with capabilities in complex reasoning and general text generation.
Large Language Model Transformers Supports Multiple Languages
L
aisingapore
2,406
1
Flan T5 Titlegen Springer
MIT
A model fine-tuned based on google/flan-t5-base, specifically designed for the abstractive summarization task of refining scientific abstracts into concise titles.
Text Generation Transformers English
F
tiam4tt
236
0
Qwen.qwen2.5 VL 3B Instruct GGUF
Qwen2.5-VL-3B-Instruct is a 3B-parameter vision-language model that supports image-to-text generation tasks.
Image-to-Text
Q
DevQuasar
1,107
3
Llama 3.1 8B SuperNova EtherealHermes GGUF
Apache-2.0
An 8B-parameter large language model based on the Llama-3.1 architecture, offering multiple quantized versions in GGUF format
Large Language Model English
L
tensorblock
44
1
T3Q Qwen2.5 14b V1.0 E3
Apache-2.0
A post-trained version based on the Qwen/Qwen2.5-14B-Instruct-1M model, using LoRA-8-4-0.0001-cosine-32-16 configuration, trained on train_data_v1.0.
Large Language Model Transformers Supports Multiple Languages
T
JungZoona
1,557
25
Hymba 1.5B Instruct
Other
A 1.5B-parameter model fine-tuned for instructions based on Hymba-1.5B-Base, capable of handling complex tasks such as mathematical reasoning, function calling, and role-playing
Large Language Model Transformers
H
nvidia
3,547
227
Videollama2.1 7B 16F Base
Apache-2.0
VideoLLaMA2.1 is an upgraded version of VideoLLaMA2, focusing on enhancing spatiotemporal modeling and audio understanding capabilities in large video-language models.
Video-to-Text Transformers English
V
DAMO-NLP-SG
179
1
Videollama2.1 7B 16F
Apache-2.0
VideoLLaMA 2 is a multimodal large language model focused on video understanding, equipped with spatiotemporal modeling and audio comprehension capabilities.
Text-to-Video Transformers English
V
DAMO-NLP-SG
2,813
10
Llama 3.1 8B Dragonfly V2
Dragonfly is a multimodal vision-language model fine-tuned with instructions based on Llama 3.1, supporting joint understanding and generation of images and text
Image-to-Text English
L
togethercomputer
113
1
Mistral 7B V0.3
Apache-2.0
Mistral-7B-v0.3 is an upgraded large language model based on Mistral-7B-v0.2, with the main improvement being the expansion of the vocabulary to 32,768 tokens.
Large Language Model Transformers
M
mistralai
442.55k
472
Llama 3 Stinky V2 8B
Other
This is an 8B-parameter model based on the Llama-3 architecture, merged using the mergekit tool, with strong text generation capabilities.
Large Language Model Transformers
L
nbeerbower
39
5
Granite 8b Code Instruct 4k
Apache-2.0
Granite-8B-Code-Instruct-4K is an 8-billion-parameter code instruction model, fine-tuned on various permissible instruction datasets based on Granite-8B-Code-Base-4K, enhancing its ability to follow instructions, including logical reasoning and problem-solving skills.
Large Language Model Transformers Other
G
ibm-granite
1,481
110
Granite 3b Code Instruct 2k
Apache-2.0
Granite-3B-Code-Instruct-2K is a 3-billion-parameter model fine-tuned from Granite-3B-Code-Base-2K, with enhanced instruction-following capabilities, particularly excelling in code generation and logical reasoning tasks.
Large Language Model Transformers Other
G
ibm-granite
1,883
36
Turkcell LLM 7b V1
Apache-2.0
A Turkish large language model based on the Mistral 7B architecture, trained on 5 billion Turkish tokens and fine-tuned with instructions
Large Language Model Transformers Other
T
TURKCELL
3,771
88
Calme 7B Instruct V0.9
Apache-2.0
Calme-7B is a 7-billion-parameter language model fine-tuned based on Mistral-7B, excelling in generating clear, peaceful, and coherent text.
Large Language Model Transformers
C
MaziyarPanahi
25
10
Gemma 1.1 2b It
Gemma is a lightweight open model series launched by Google, built on the same technology as Gemini, suitable for various text generation tasks.
Large Language Model Transformers
G
google
71.01k
158
Codellama 70b Instruct Hf
Code Llama is a series of code generation and understanding models released by Meta, ranging from 7 billion to 70 billion parameters. This model is the 70 billion parameter instruction fine-tuned version.
Large Language Model Transformers Other
C
meta-llama
505
18
14B DPO Alpha
CausalLM/14B-DPO-α is a large-scale causal language model supporting Chinese and English text generation tasks, with outstanding performance in MT-Bench evaluations.
Large Language Model Transformers Supports Multiple Languages
1
CausalLM
172
118
Finma 7b Nlp
MIT
FinMA-7B-NLP is a large language model for the financial domain developed by the PIXIU project, specifically designed to understand complex financial terms and concepts, significantly improving performance in downstream financial tasks through natural language instruction fine-tuning.
Large Language Model Transformers English
F
ChanceFocus
575
11
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase